Protocols for Stochastic Shortest Path Problems with Dynamic Learning by Vural Aksakalli

نویسنده

  • Vural Aksakalli
چکیده

The research problem considered in this dissertation, in its most broad setting, is a stochastic shortest path problem in the presence of a dynamic learning capability (SDL). Specifically, a spatial arrangement of possible-obstacles needs to be traversed as swiftly as possible, and the status of the obstacles may be disambiguated (at a cost) en route. No efficiently computable optimal protocol is known and many similar problems have been proven intractable. Chapter 1 defines SDL in continuous and discrete settings, and introduces the Random Disambiguation Paths Problem (RDP), a continuous variant of SDL wherein a decision maker (DM) needs to swiftly navigate from one given location to another through an arrangement of disc-shaped possible-obstacles in the plane. At the outset, the DM is given the respective probabilities that the discs are truly obstacles and, en route, when situated on a disc’s boundary, the DM has the option to disambiguate the disc, i.e., learn at a cost if the disc is truly an obstacle. The central question is to find a protocol that decides what and where to disambiguate en route so as to minimize the expected length of the traversal. For any RDP instance, the continuous plane can be approximated by a graph (a lattice), and edges that intersect discs can be appropriately probabilistic, giving rise to the Discrete RDP Problem (DRDP), which is a special case of the well-known Canadian Traveler Problem (CTP) in the literature, but with statistical dependency among the edges. The chapter concludes with a comprehensive review of the litera-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The BAO* algorithm for stochastic shortest path problems with dynamic learning

Suppose a spatial arrangement of possible obstacles needs to be traversed as swiftly as possible, and the status of the obstacles may be disambiguated en route at a cost. The goal is to find a protocol that decides what and where to disambiguate en route so as to minimize the expected length of the traversal. We call this problem the Stochastic Shortest Path Problem with Dynamic Learning (SDL),...

متن کامل

The Reset Disambiguation Policy for Navigating Stochastic Obstacle Fields

The problem we consider is a stochastic shortest path problem in the presence of a dynamic learning capability. Specifically, a spatial arrangement of possible obstacles needs to be traversed as swiftly as possible, and the status of the obstacles may be disambiguated (at a cost) en route. No efficiently computable optimal policy is known, and many similar problems have been proven intractable....

متن کامل

Dynamic Multi Period Production Planning Problem with Semi Markovian Variable Cost (TECHNICAL NOTE)

This paper develops a method for solving the single product multi-period production-planning problem, in which the production and the inventory costs of each period arc concave and backlogging is not permitted. It is also assumed that the unit variable cost of the production evolves according to a continuous time Markov process. We prove that this production-planning problem can be Stated as a ...

متن کامل

ALGORITHMS FOR BIOBJECTIVE SHORTEST PATH PROBLEMS IN FUZZY NETWORKS

We consider biobjective shortest path problems in networks with fuzzy arc lengths. Considering the available studies for single objective shortest path problems in fuzzy networks, using a distance function for comparison of fuzzy numbers, we propose three approaches for solving the biobjective prob- lems. The rst and second approaches are extensions of the labeling method to solve the sing...

متن کامل

Solving Stochastic Shortest-Path Problems with RTDP

We present a modification of the Real-Time Dynamic Programming (rtdp) algorithm that makes it a genuine off-line algorithm for solving Stochastic Shortest-Path problems. Also, a new domainindependent and admissible heuristic is presented for Stochastic Shortest-Path problems. The new algorithm and heuristic are compared with Value Iteration over benchmark problems with large state spaces. The r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007